Speech E Ha Ceme T Usi G Soft Thresholdi G with Dct-emd Based Hybrid Algorithm

نویسندگان

  • Erhan Deger
  • Md. K. Islam Molla
  • Keikichi Hirose
  • Md. Kamrul Hasan
چکیده

This paper introduces a new speech enhancement method using soft thresholding with a Discrete Cosine Transform (DCT) and Empirical Mode Decomposition (EMD) based hybrid algorithm. Soft thresholding for DCT-enhancement is a powerful method for enhancing the noisy speech signal in a wide range of signal-to-noise ratios (S Rs). However, due to the thresholding criteria a significant amount of noise is left in the enhanced signal. EMD is applied here to remove the remaining noise components. Due to the frequency characteristics of the intrinsic mode functions (IMFs), the noise components are mainly centered in the lower order IMFs. Therefore, it is possible to successfully identify and remove the remaining noise. The experimental results show that the proposed hybrid method is significantly more effective in removing the noise components from the noisy speech signal; thus giving better results in output S R and quality compared to recently reported techniques.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of Gain in LD-CELP Using Hybrid Genetic/PSO-Neural Models

In this paper, the gain in LD-CELP speech coding algorithm is predicted using three neural models, that are equipped by genetic and particle swarm optimization (PSO) algorithms to optimize the structure and parameters of neural networks. Elman, multi-layer perceptron (MLP) and fuzzy ARTMAP are the candidate neural models. The optimized number of nodes in the first and second hidden layers of El...

متن کامل

Prediction of Gain in LD-CELP Using Hybrid Genetic/PSO-Neural Models

In this paper, the gain in LD-CELP speech coding algorithm is predicted using three neural models, that are equipped by genetic and particle swarm optimization (PSO) algorithms to optimize the structure and parameters of neural networks. Elman, multi-layer perceptron (MLP) and fuzzy ARTMAP are the candidate neural models. The optimized number of nodes in the first and second hidden layers of El...

متن کامل

Image Reco Structio a D Resolutio E Ha Ceme T Algorithm for Fmcw Medical Ultrasou D Imagi G Systems

This paper presents an overview of the graduate research program on the design and development of the high-performance image reconstruction and resolution enhancement algorithm for an advanced medical ultrasound imaging system. The data acquisition is conducted with a micro flexible transceiver array, operating in the multi-static steppedfrequency FMCW mode. The objective of this system is to p...

متن کامل

Dct Based Pitch Modification

In this paper, we propose a novel algorithm for pitch modification. The linear prediction residual is obtained from pitch synchronous frames by inverse filtering the speech signal. Then Discrete Cosine Transform (DCT) is applied on these pitch synchronous frames. Based on the desired factor of pitch modification, the dimension of the DCT vector is changed by truncation or zero padding, and then...

متن کامل

Empirical Mode Decomposition for Advanced Speech Signal Processing

Empirical mode decomposition (EMD) is a newly developed tool to analyze nonlinear and non-stationary signals. It is used to decompose any signal into a finite number of time varying subband signals termed as intrinsic mode functions (IMFs). Such data adaptive decomposition is recently used in speech enhancement. This study presents the concept of EMD and its application to advanced speech signa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007